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18.152 Introduction to PDEs, Fall 2011 Professor: Jared Speck 

Class Meeting #21: Lagrangian Field Theories 

"Many of the PDEs of interest to us can be realized as the Euler- Lagrange equations corresponding 
to a function known as a Lagrangian C Closely related are the notions of the action corresponding 
to the Lagrangian, and the notion of a stationary point of the action. These ideas fall under a 
branch of mathematics known as the calculus of variations. As we will see, these ideas will provide 
a framework for deriving conserved (and more generally almost-conserved) quantities for solutions 
to the Euler-Lagrange equations, the availability of which plays a central role in the analysis of 
these solutions. Some important examples of PDEs to which these methods apply include the 
familiar linear wave equation. Maxwell's equations of electromagnetism, the Euler equations of 
fluid mechanics, and the Einstein equations of general relativity. 

1. Variational formulation (The Action Principle) 

In this section, we will study (scalar-valued) functions (f) on R-^''""'. They are sometimes called 
(scalar- valued) fields on M^+"^. We will use the notation 

(1.0.1) x= (x°,x\--- ,x") 

to denote the standard coordinates on R^+", and as usual, we will sometimes use the alternate 
notation = t. We will use the notation 

(1.0.2) V0= (Vt0,Vi0,-- - ,V„0) 

to denote the spacetime gradient of 0. We will study PDEs that are (in a sense to be explained) 
generated by a Lagrangian. 

Definition 1.0.1 (Lagrangian). A Lagrangian £ is a function of and V0 (and sometimes the 
spacetime coordinates x and perhaps other quantities too). We indicate the dependence of C on 
e.g. (p ^-iid V0 by writing 

(1.0.3) /:(0,V0). 

Example 1.0.1. As we will see, C =^ |(m^^)"''VQ0V/30, is the Lagrangian corresponding to the 
linear wave equation, where — diag(— 1, 1, 1, • • • , 1) is the standard Minkowski metric. 

Given a Lagrangian C and a compact subset of spacetime we can define an important functional 
known as the action. The action inputs functions (f) and outputs a real number. 

Definition 1.0.2 (Action). Let ^ C R^+" be a compact subset of spacetime. We define the action 
A (j) over the set ^ by 
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(1.0.4) A[(t);^]= [ £(0(x),V0(x))c/^+"a;. 

Above, d^~^"'x '= dtdx^dP ■ ■ ■ dx^ denotes spacetime integration. We often omit the argument x oi (f) 
and V0. 

A main theme that runs throughout this section is that it is possible to generahze certain aspects 
of standard calculus, which takes place on M^"*"", to apply to (infinite dimensional) spaces of func- 
tions. In this context, the action A plays the same role that a function plays in standard calculus. 
Moreover, many important PDEs have solutions that are stationary point^ for the action. The 
notion of a stationary point is a generalization of the notion of a critical point from calculus. In 
order to define a stationary point of we will need to introduce the notion of a variation. The 
motivation behind the next two definitions is that we would like to understand how ^[0; ^] changes 
when we slightly change 0. 

Definition 1.0.3 (Variation). Given a compact set ^, a function ip G C^(^) is called a variation. 
Definition 1.0.4. Given a variation ip and a small number e, we define 



(1.0.5) 0/=0+ ^ 

tiny perturbation of 

We now give the definition of a stationary point of the action. Stationary points are the moral 
equivalent of critical point^ from calculus. 

Definition 1.0.5 (Definition of a stationary point 0). We say that is a stationary point of 

the action if the following relation holds for all compact subsets ^ and all variations ip G C^(^) : 

(1.0.6) ^ ^[0,;j^]=O. 

de e=o 

The next theorem is central to our discussion in this section. It shows that the stationary points 
of A verify a PDE called the Euler- Lagrange equation. 

Theorem 1.1 (The Principle of Stationary Action). Let £(0, V0, x) be a Lagrangian. 
Then a field (j) is a stationary point of the action if and only if the following Euler- Lagrange 
PDE is verified by : 



(1.0.7) 



/9£(0,V0,x)\ 9£(0,V0,x) 



/ g£(0,V0,x) \ 
V 5(V«0) J 



Above, ^^Q^^^^^^ denotes partial differentiation of C with respect to its argument Va0 with its other 
arguments (e.g., the other V^0 with fi ^ a, (j), x, etc.) held fixed. 

Proof. Let ^ C M^^" be a compact subset of spacetime and let ip be any variation with support 
contained in ^. For any e > 0, we define as in (1.0.5): 0^ =^ + eijj- We then differentiate under the 
integral and use the chain rule to conclude that 



""^Evcn though they are called "stationary points," they are actually fields on 
^Recall that a: is a critical point of the function / if /'(.t) = 0. 
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1.0.81 



C{(j)e,V(f)e,x)d 



1+n 



X 



d^C{(f)e, V0£, x) d 



1+n 



X 



d, 



dC{(f)„V(f)„x) 




d^+^'x. 



Above, (9e denotes the derivative with respect to the parameter e with all other variables held fixed. 
We now set e = 0, integrate by parts in (1.0.8) (and observe that the conditions on ip guarantee 
that there are no boundary terms) to deduce that 



(1.0.9) 



de 



a(v. 



dC{4>,V^,x) 



X 



We now observe that ( 1.0.9 ) is equal to for all variations 'ip if and only if the term in large brackets 



on the right-hand side of (|1.0.9|) must be 0. Since this observation holds for any compact subset A, 

□ 



we have thus shown that (1.0.7) holds if and only if is a stationary point of the action. 



Example 1.0.2. Let C 



def 1 



m 



-l\af 



Va4>^l34> (note that this £ does not directly depend on x) 



where m ^ = diag(— 1, 1, 1, ■ ■ ■ , 1) is the standard Minkowski metric. Then 



(1.0.10) 
(1.0.11) 



g>C(0,V0) 



[m 



Therefore, the Euler-Lagrange equation corresponding to C is 



(1.0.12) 



V^((m-i)^"V. 



Note that equation (1.0.12) is just the familiar linear wave equation (m ^)"^VaV/30 = 0. 



2. Coordinate Invariant Lagrangians 

Many important PDEs are the Euler-Lagrange equations corresponding to coordinate invariant 
Lagrangians; we will explain what this means momentarily. Motivated by this claim, we will now 
introduce a class of changes of coordinates on spacetime. The new coordinates will be formed by 
fiowing the old coordinates in the direction of a vectorfield Y on spacetime. These new coordinates 
will therefore verify a system of ordinary differential equations generated by the fiow of Y. In the 
next proposition, we review some facts concerning these new coordinates; these facts are basic 
results in ODE theory. 
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Proposition 2.0.1 (Basic facts from ODE theory for autonomous systems). Let Y{x) = 
■ ■ ■ , x"), F^(a;°, ■ ■ ■ , x"), ■ ■ ■ , ■ ■ ■ , x")) be a smooth vectorfield on ]R^+". Assume that 



there exists a uniform constant C > such that 



(2.0.13) 



\Vf,Y%x)\<C, xe 



)l+n 



< fi,!/ < n. 



Consider the initial value problem (where the independent variable is the "flow parameter" e) for 
the following system of ordinary differential equations: 



(2.0.14) 
(2.0.15) 



d 
de 



x^'(e) 



Y^'(x) 



J^(0) = x^. 



Then there exists a number eo > such that the initial value problem (2.0.14) - (2.0.15) has a 
unique smooth (in e) solution existing on the interval e G [— eo,eo]. 

Let us denote the 'flow map" from the data x to the solution x at flow parameter ebyx = F^{x). 
Then on the interval [— eo,eo]; the flow map 



(2.0.16) 



, TP /■ \ def ^ 
X — )■ -Tf X = X. 



is a smooth (in x), bijective map from ]R^+" to ]R^+" with smooth inverse F_^{-), i.e., x = F^{x) =^ 
X = -F_e(x); such maps are called diffeomorphisms ofM}^^. Furthermore, if\€i \ + |e2| < cq, then 
the flow map verifies the following one-parameter commutative group properties: 



F o F = F o F = F : 



(2.0.17) 

Let us also denote the derivative matrix corresponding to the flow map F^ by M 



(2.0.18) 



def 



dx^ 
dx^ 



Then if \e\ is sufficiently small, we have the following expansions in e : 



(2.0.19) 
(2.0.20) 

(2.0.21) 
(2.0.22) 



x'^ 



def 



F^{x) =x'' + eF'^(x) + e27^'^(e, x), 



= + eV,F^(x) + t^V,^Z^'{e, x), 



dx^ 
dx'^ 



5i:-eV,Y^ix) + e^Si^{e,x), 



detM-^ = 1 - eV„F" + e^5(e, x). 



Above, 7^'^(e,x), V^7e'^(e,x),5^(e,x), S{e , x) are smooth functions of (e,x) for e G [ — eo,eo], x G 



Remark 2.0.1. The assumption (2.0.13) guarantees that the "time of existence" eo can be chosen 



to be independent of the initial data x. 



MATH 18.152 COURSE NOTES - CLASS MEETING # 21 



5 



Proof. Most of the results of Proposition 2.0.1 are standard facts from ODE theory and will not 



be proved here. We will show how to derive the expansions (2.0.21) and (2.0.22) from the other 
results. To this end, we will need some basic facts from matrix theory. We will use the following 
norm for (1 + n) x (1 + n) matrix- valued functions on M^"*""- : 



(2.0.23) 



I Mil =^ max 



J2 



Now if / is the 1 + n identity matrix]^ and \\A\\ is a sufficiently small (1 + n) x (1 + n) matrix, then 
the matrix M = {I - A)-^ can be expanded in a convergent series: 

(2.0.24) {I - A)-^ = I + A + + + ■■■ . 

Note in particular that the tail (i.e., all but the first two terms) can be bounded by 

(2.0.25) 11^2 + A^ + A^ + ---\\ = \\A\I - Ay^W < 2\\A\\\ 

if II A II is sufficiently small. We now apply (2.0.24) and ( |2.0.25[ ) to the matrix M defined in (2.0.18) 



(where A^^ =^eViyF'^), thereby arriving at (2.0.21). 

To derive (2.0.22), we first Taylor expand the determinant (viewed as a real- valued function of 
matrices) for sufficiently small ||y4|| : 

(2.0.26) det(/ + A) = 1 + A° + 0(||A||2) 

Above, we write 0(||A|p) to denote a term that can be bounded by C||A||^, where C > is some 



positive constant independent of (all sufficiently small) A. The expansion (2.0.22) now follows from 



(|2.0.21|) and (|2.0.26). We remark that you will derive the expansion (2.0.26) in your homework in 

□ 



more detail. 



We will now "define" how various fields and their derivatives transform under a change of coordi- 
nates. A full justification of these definitions can be found in books on tensor analysis or differential 
geometry. 

Definition 2.0.6 (Transformation properties of fields). Let 0(x) be a scalar- valued function, 
let m(x) be an (invertible) metric (depending on x) with components mn^{x), and let a; — )■ x be a 
spacetime diffeomorphism. Then upon changing coordinates x — )■ x, these quantities transform as 
follows: 



(2.0.27a) 







01 (xox) ) 


(2.0.27b) 








(2.0.27c) 








(2.0.27d) 


{m-X" 




M^\, ~ 



'\xoxj 

_ rn I 
)u I (xox) ''''aP I (xox) ) 

{m-^T^\(^ox). 



^Note that 11^ = 5i^. 
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def 



d 



Above and throughout, we use the notation 

(2.0.28a) ^ 
(2.0.28b) ^ 
j^/x def 1^ derivative matrix defined in (2.0.18), and (M 



del d 



dxt^ ' 



||(^ is its inverse. Furthermore, 



the notation x ox indicates that we are viewing a; as a function of x; this is possible since x — )■ a; is 
a diffeomorphism. 



Remark 2.0.2. (2.0.27a) simply says that the transformed function takes the same value at the 



new coordinate x that takes at the old coordinate x. (2.0.27b ) is really just the chain rule expressing 



dxt^ ■ 



(2.0.27c) is the standard transformation law for tensors with two upstairs 



in terms of 

indices. These transformation laws generalize to other tensors in a straightforward fashion; the 
generalization can be found in books on tensor analysis/ differential geometry. Roughly speaking, 
tensors with indices downstairs transform by multiplication by the matrix (one copy of 
for each index), and tensors with indices upstairs transform by multiplication by the matrix M (one 
copy of M for each index). 

We will now define what it means for a Lagrangian to be coordinate invariant. 

Definition 2.0.7 (Coordinate invariant Lagrangian). Let £(0, V0, m) be a Lagrangian that 
depends only on 0, V0, and the Minkowski metric m. We say that C is coordinate invariant if for 
all spacetime diffeomorphisms x ^ x, we have that 



(2.0.29) ^(Pix), V0(x), m(x)) = £(0(J), V0(x), m(x)), 

where the transformed fields are defined in Definition 12.0.61 



Example 2.0.3. Consider the Lagrangian for the linear wave equation: £(0, V0, m) 
Using (|2.0.27aD - (|2.0.27cD and the fact that M^{M-^Y = Sfl we compute that 



(m 



'')^"V^0V,^ 



(2.0.30) 



£(0,V0,m) ^=^^--(m-i^^- 



V^0V,v 



(M-^)^V.0(M-1)JVa^ 



= -^(m-i)^^V^0V,0. 
This Lagrangian is therefore coordinate invariant. 

As we will see, the availability of an energy-momentum tensor for certain Euler-Lagrange equa- 
tions is closely connected to the coordinate invariance property of their Lagrangians. In order to 
derive this connection, we will need to understand more about how the coordinate transformations 



(2.0.16) vary with e. 



^Recall that (5„ = 1 if a = k and 5^ = 1 if a ^ k; (5^ can be viewed as the identity matrix. 
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Proposition 2.0.2 (Derivatives with respect to the flow parameter e). Let = F^{x) 



be the change of spacetime coordinates defined in (2.0.16), and let (p, Vf^(j), m^y, {m ^)^'^ be the 



transformed fields defined in Definition 2.0.6 Then the following identities hold for all spacetime 
points X : 



(2.0.31a) 



e=0 



(2.0.31b) V^0b = -V^Y'^l^VaMx - l^lsV^V^^Is = -V^(r"V,0)|3., 

e=0 



(2.0.31c) 
(2.0.31d) d. 



e=0 



Tf^iiv\x ^ua\x^ uY \x ^aalx^uY \x 



^ Ix y a"''^u I X 



[m 



(m 



-l\au 



e=0 



for the Minkowski metric 



for the Minkowski metric 



(2.0.31e) d,\,=odetM-^\x = -V„r"|s. 



Above and for the remainder of these notes, denotes the derivative of an e— dependent quantity 
with the new coordinates x held fixed. 

Remark 2.0.3. In the language of differential geometry, the tilded fields are the Lie derivatives of 
the un-tilded fields with respect to the vectorfield —Y. 

Proof. Recall that = F^{x), x^ = Fl^^[x), Fq{x) = x^ (so that x = x when e = 0) and 
deF^{-) = Yf^^-). Therefore, using the chain rule, we compute that 



(2.0.32) 



We have thus shown (2.0.31a). 



Similarly, with the help of ( |2.0!2l| , and noting that {M'^Y^ = 6"^ when e = and d,\^=o [{M"'^)'^ o 
\x = — V^F"!^, we compute that 

(2.0.33) 

9e|e=0V^^(x) = d,U=o{{M-X ° F-e(^)(V„0) O F_,(x)} 

= {d,U=o [{M~X ° F^e{x)]]{Va<P) o F_,(J) + {M-X ° F_,(J)a,|,=o(V,0) o F^,{x) 
= -V^Y%V^<P\x - {M-X\xY^\sVpVa<P\x. 



We have thus shown we have thus shown (2.0.31b). The proofs of (2.0.31c) and (2.0.31d) are similar, 
and we omit the details. 



To prove (2.0.31e), we simply differentiate the expansion (2.0.22) with respect to e and set e = 0. 

□ 



We now state the following simple corollary to Proposition 2.0.2 
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Corollary 2.0.3 (The derivative of L with respect to the flo w par ameter e). Let £(0, V0, m) 
he a Lagrangian. Then under the assumptions of Proposition 2.0.2 . the following identity holds 
at all spacetime points: 



(2.0.34) 



9,|,=O'^(0, V0,m) 



dC{(f), V(/), m) 
dC{(p, V0, m) 
dC{(p, V0, m) 



dm 



v^(y"Va0) 



Proof. By the chain rule, we have that 



(2.0.35) 



~ ~ ^ , dC{(f), V0, m) 
d,£{(f),V(f),m) = — d. 



9m 



The relation (2.0.34) now follows from Proposition 2.0.2 and (2.0.35). 



□ 



3. The energy-momentum tensor 

The main goal of this section is to show that for a certain class of coordinate invariant Lagrangians 
£, there exists an energy-momentum tensor T^^ . This T^'^ plays the same role in the analysis of the 
corresponding Euler-Lagrange equation corresponding to £ as it did in our previous analysis of the 
linear wave equation. More precisely, for solutions to the Euler-Lagrange equation corresponding 
to £, we will show that V ^T^'^ = 0. As we saw earlier in the course, this identity forms the basis 
for the derivation of conserved quantities in solutions to the Euler-Lagrange equations. 

Theorem 3.1 (Derivation and divergence-free property of the energy- momentum ten- 



sor). Let C{(j),W(j),m) be a coordinate invariant Lagrangian (in the sense of Definition 2.0.7) that 
depends only on (p, V0, and the Minkowski metric m. Let 



(3.0.36) 



he the energy-momentum tensor corresponding to C Then T^'^ is symmetric: 



(3.0.37) 



rpflU rp 



UfJ, 



< fi,u < n. 



Furthermore, if (j) verifies the Euler-Lagrange equation (1.0.7), the following divergence identity is 
verified by T^^ : 



(3.0.38) 



V^T^^ = 0, (z/ = 0,l,2,--- ,n). 
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)l+n 



— 



Proof. The relation (3.0.37) follows easily from (3.0.36) since 

We will now prove (3.0.38). To this end, let ^ C M}^"' be a compact spacetime subset, and let 



be a smooth vectorfield with support contained in A. Let x be the change_o. 



Y : 

variables (2.0.16), and consider the transformed quantities 0, V0, m, given in Definition 



2.0.6 



Now by assumption, we have that £(0, V0, m) = £(0, V0, m). Furthermore, by the standard change 
of variables theorem from advanced calculus, we have that d}^^x = det'r^d^^^'x = detM~^(i^+"x, 



where the matrix M is defined in (2.0.18). Therefore, we have that 



(3.0.39) 



A[<p] f C{^, V0, m) 



£(0, V0,m) detM-^^+'^J. 



Now the left-hand side of (3.0.39) doesn't depend on e. We therefore have that 

d 



(3.0.40) 



de 



\e=0 



^[0; Si] = 0. 



On the other hand, we can differentiate under the integral on the right-hand side of (3.0.39) with 



respect to e at e = and use (2.0.31e) plus Corollary 2.0.3 (together with the fact that x = x when 
e = 0) to deduce that 



d I 
de 



dC{(l),V(f>,m)^ 



dC{(f), V0, m) 



(3.0.41) f |.=o^[0; Si] = I - ^^^"y7'"^^ y"V.0 - ^^^7;^^f^V^(F"V„0) d'+^x 

Ja V0 9(V^0) 



9£(0, V0, m) 



dm 



III/ 



a; 



- / £(0, V0,m)V„F"d^+"x. 



Integrating by parts in (|3.0.4l|) and using (|3.0.40|), we have that 
(3.0.42) 







r 9£(0, V0, m) 



I V0 

(9£(0, V0, m) 



/ g£(0,V0,m) \ 
V 5(V^0) 



- I C{(t),V(t),m)VaY"d^+'^x. 
J a 



We now note that the Euler-Lagrange equation (1.0.7) implies that the first line on the right-hand 



side of (3.0.42) is 0. Therefore, we collect the remaining terms together to derive that 



(3.0.43) = - ^ + \{m-'Y''C{<P, V0, m) \ {m^^V + m^^V.F"} d'^^x 



dm 



fii/ 



[ |2^^(|Z^ + (,n-i)/^^£(0, V0,m)|m„.V,y"rf^+-a:. 
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Integrating by parts in (3.0.43), we deduce that 



(3.0.44) 



dm 



Since (3.0.43) must hold for all such smooth vectorfields Y with support contained in A, we 



conclude that the divergence of the term in braces is 0: 



(3.0.45) 



9£(0, V0, m) 



+ (m-^)^^£(0, V0,m) } = 



We have thus shown that (3.0.38) holds 



□ 



Example 3.0.4. The Lagrangian for the linear wave equation is £ = — |(m 



-i)"^V«0V^0. We 



therefore appeal to ( |3.0.36[ ) and calculate that 



(3.0.46) 



dm 



(m 



flU 



m 



m 

2^ 



m 



Remark 3.0.4. To derive (3.0.46), we have used the fact that if q is any quantity, and m is a 



symmetric invertible (1 + n) x (1 + n) matrix that depends on g, then 



(3.0.47) 



dq 



[m 



(m-^Y'^im-^Yl^ 



dq 



-m. 



a(5, 



< fi,^ < n. 



You will derive the simple relation (3.0.47) in your homework. In particular, it follows from (3.0.47) 
that 



(3.0.48) 



d{m' 



9m K 



-{m-')^^{m-y\ 



On the left-hand side of (3.0.48), we are viewing the components (m ^)^'^ as functions of the 
components m^A, < n, \ < n. 
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